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METHODS AND APPARATUS FOR CODING AND DECODING DATA 

USING REED-SOLOMON CODES 

Field of the Invention 
The present invention relates to coding data and, more particularly, to 
methods and apparatus for Reed-Solomon coding. 

Background of the Invention 
The procedure for transmitting and receiving information using Reed- 
Solomon codes for error detection and correction can be described as follows. 
Information to be transmitted, universally represented as a stream of bits, is 
divided into small blocks of a certain size m. These small blocks of m bits can be 
identified with the elements of a mathematical structure known as finite field (or 
Galois Field) in the sense that there is one-to-one mapping between the set of all 
possible 2 m blocks of m bits and the set of all 2 m elements of the Galois Field 
(GF) of dimension 2 m , GF(2 m ). The Galois field itself can be constructed as the 
set of all polynomials of degree <m with the coefficients from the "ground" 
binary field GF(2) whose elements are 0 and 1 and two field operations, addition 
and multiplication, defined as follows: addition is performed as an "exclusive 
OR" (XOR) operation on the ordered w-tuple of polynomial coefficients and 
multiplication is implemented as polynomial multiplication modulo certain 
"primitive" binary polynomial of degree m as field multiplication, where the 
"primitive" binary polynomial is irreducible (can not be factored further into a 
product of binary polynomials of lower degree) and in addition, does not divide 
any binary polynomial of the form x-l for d<2 m -l. 

One of the most important and common cases is m=8, where each such a 
block is represented as byte and can be conveniently handled by any kind of 
computer, so the following discussion is for the case where bytes are the 
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elements of GF(256), with the understanding that "byte" can be an element, or 
"symbol", of the Galois field of dimension 2 m with m*8. The primitive 
polynomial that allows building of GF(256) is p(x)=x 8 +x 4 +x 3 +x 2 +L 

After setting up the basic units of information (bytes or symbols) and the 
5 corresponding Galois Field (GF), information to be transmitted is divided into 
groups of symbols of equal size k, which is found based upon the desired error 
correction capability of the code. Each such a group is called "message". A 
message is further interpreted as a polynomial M(x) over GF of degree k-1 

10 M(x)= mox 1 "' 1 ^ m 1 x k ' 2 +...+ m k _j (1) 

whose coefficients m, in the ascending order are information symbols in the 
order they were obtained from the original stream of bits treated as GF elements. 
To construct a Reed-Solomon code capable of correcting t errors, a polynomial 
15 G(x) of degree 2/, known as a generator polynomial, is defined according to the 
following formula 

G(x)= (x-a 2ul ) fx-a 2 <- 2 j... (x-a 1 ) (x-a°)= x 2t + g 2t _ } x 2 '' J +... + g 0 (2) 

20 where a denotes the so-called "primitive" element of the GF possessing two 
important (and related) properties: first, it satisfies the equation p(a)=0, and 
second, as a consequence, all non-zero elements of the GF can be expressed as 
powers of the primitive element and form an abelian (or commutative) group 
under the operation of GF multiplication. It is exactly this property that permits 

25 creation of an "anti-log" table that establishes a convenient way to perform 
multiplication of the GF elements using this table. 
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Then the remainder R(x) of the division of message polynomial M(x) multiplied 
by x 2t by the generator polynomial G(x) is found: 



M(x)x 2t = Q(x)*G(x)+R(x), 



(3) 



The remainder is a polynomial of degree 2t-l and finally, 2t coefficients of this 
polynomial are used as check symbols which are appended to k message symbols 
forming a string of k+2t symbols that comprise a "codeword" in systematic form. 
The length of the codeword should not exceed n-1, the number of non-zero 
10 elements in the GF. So the message actually transmitted over a communication 
channel can be represented as the polynomial 



15 where r m are check symbols - redundant symbols that allow detection and 

correction of up to t errors. 

The task of encoding consists, therefore, in performing the division of 

polynomials whose coefficients are symbols of some GF. A conventional 

software approach to deal with calculations in the GF is based on using two look- 
20 up tables where transformations from the polynomial form of the GF symbols to 

exponential form and back are stored. This method can be summarized (for our 

mam case ofGF(2 8 )) as follows: 



M(x) *x 2t +R(x) = M(x) *x 2t +r 2t .j x 2 ' _/ + ... + r 0 



(4) 



25 



1 . Symbols (elements of the GF) in the polynomial form are represented as 
bytes: 



b—(b 7 b 6 bsb4b 3 b 2 bjbQ). 



(5) 
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2. The sum of the symbols is found as a bitwise XOR operation: f+g=j*g. 

3. The primitive element a in polynomial form is represented as 
a= (000000 10), which in polynomial notation corresponds to a(x)=x. 

4. Each non-zero symbol S is primitive element a raised to a certain power 
s=a k . Using polynomial algebra manipulations with binary polynomials, a 
discrete logarithm table logjable and an exponent table expjable are pre- 
calculated to facilitate transformations between exponential (power) and 
polynomial (byte) representation of (non-zero) elements so that 

if a k =b, then k=log table [b] f b=expjable[k] . (6) 

To multiply GF symbols using these tables, the following chain of 
15 operations should be performed assuming operands u and v and result r are in 
byte form: 

GFMUL(u,v) = GFMUL(a logJable[u] , a losJable[v] ) = a ftogJablefuJ + l °g- table M) %255 ^ 

20 r=GFMUL(u,v) = expjable [(logjable [u] + logjable fvj ) % 255 ] (8) 

where % denotes modulo operation (x%255 is the remainder of the division of x 
by 255). 

25 It is clear that the speed of calculation of the result of multiplication over 

GF based on this approach is limited by the fact that multiple access to the tables 
is needed and the address of the entry into the exponent table should be 
calculated by addition. In many cases, access through the modified address 
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register requires 2-3 waiting cycles so the total time of one GF multiplication can 
be 10-12 clock cycles per symbol (byte). This makes software implementation of 
the Reed-Solomon codes a challenging problem - e.g., in a typical case of 
(204,188,16) Reed-Solomon code with 188 data bytes and 16 check bytes to 
5 encode one codeword 188*16*(10-12) ~= 30-36 K cycles are needed, as 

described below. Thus, for example, for the ADSL application with a rate of 
transmission of 4 K Symbols/sec, the total consumption of MIPS for only the 
encoder operation is 120-144 MIPS, which is prohibitively high. 

10 Summary of the Invention 

According to a first aspect of the invention, a method is provided for 
encoding data. The method comprises (a) providing a first table of first 
component values multiplied by code generator coefficients and a second table of 
second component values multiplied by the code generator coefficients, (b) 

15 determining a Galois field element based on a message symbol and a high order 
remainder symbol, (c) separating the Galois field element into first and second 
components, (d) using the first component and a generator coefficient index to 
access one or more first table values in the first table and using the second 
component and the generator coefficient index to access one or more second 

20 table values in the second table, (e) determining a current remainder symbol 

based on first and second table values having equal generator coefficient indices, 
and a previous remainder symbol, (f) performing step (e) for each generator 
coefficient of the code generator to provide a set of remainder symbols, and (g) 
performing steps (b) - (f) for each message symbol in a codeword to provide a 

25 set of final remainder symbols which constitute check symbols to be transmitted 
with the message symbols of the codeword. 

According to a second aspect of the invention, apparatus is provided for 
encoding data. The apparatus comprises a first lookup table containing first 
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component values multiplied by code generator coefficients, a second lookup 
table containing second component values multiplied by the code generator 
coefficients, and a processor. The processor comprises means for determining a 
Galois field element based on a message symbol and a high order remainder 

5 symbol, means for separating the Galois field element into first and second 
components, means for using the first component and a generator coefficient 
index to access one or more first table values in the first lookup table and for 
using the second component and the generator coefficient index to access one or 
more second table values in the second lookup table, and means for determining 

10 a current remainder symbol, based on first and second table values having equal 
generator coefficient indices, and a previous remainder symbol, for each 
generator coefficient of the code generator to provide a set of remainder symbols. 
Each message symbol of a codeword is processed to provide a set of final 
remainder symbols which constitute check symbols to be transmitted with the 

15 message symbols of the codeword. 

Brief Description of the Drawings 
For a better understanding of the present invention, reference is made to 
the accompanying drawings, which are incorporated herein by reference and in 
20 which: 

FIG. 1 is a schematic block diagram of a prior art method for Reed- 
Solomon encoding; 

FIG. 2 is a schematic block diagram that illustrates a method for Reed- 
Solomon encoding, wherein the Galois field element is separated into halfwords 
25 before multiplication by the code generator coefficients; 

FIG. 3 is a schematic block diagram that illustrates Reed-Solomon 
encoding in accordance with an embodiment of the invention; 
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FIGS. 4A and 4B illustrate embodiments of lookup tables for Reed- 
Solomon encoding in accordance with the invention; 

FIG. 5 is a schematic block diagram that illustrates Reed-Solomon 
encoding in a digital signal processor in accordance with an embodiment of the 
5 invention; and 

FIG. 6 is a flow chart that illustrates Reed-Solomon encoding in 
accordance with an embodiment of the invention. 



Detailed Description 

10 To describe a more efficient method of performing the specific task of 

encoding a message according to the Reed-Solomon (RS) coding procedure of 
the invention, a detailed description of the calculations involved is needed. The 
problem is formulated as finding the remainder of the division of the message 
polynomial M(x) multiplied by x 2t by the code generator polynomial G(x). 

15 The remainder can be calculated by iterating the message - increasing it's 

length, shifting already received bytes of the message to the left and moving in 
from the right a new byte, so that at the first step message M 0 (x) includes only 
one byte m 0 > 

20 M 0 (x) = mox°=m 0 , (9) 

and message M } (x) is considered to include two bytes, mi and m 0 with it's first 
byte m 0 shifted to the left by one position, which in polynomial notation can be 
written as 

Mi(x) = mox+rri! = M 0 (x)*x + m } (10) 
It is convenient to introduce the "tail" g(x) of the generator polynomial G(x) 
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G(x)= x 2 '+ g 2t ., x 2 '-'+... + g 0 = x 2 '+ g(x) (11) 
with the property 

5 G(x)+g(x)=x 2 '. ' (12) 

Using this property, at the first step, when only the first symbol of the message is 
encoded (the remainder r 0 is found), the following relations hold: 

10 M 0 (x) x 2 ' = mox 2 ' = Q 0 (x)G(x) + R 0 (x), R 0 (x) = r 0 , 2l -ix 2 '-' + ... + r 0 , 0 (13) 

mox 2 ' = m 0 G + m 0 g (14) 

Therefore, iterations start with 

15 

Qo(x)=m 0 , R 0 =m 0 g. (15) 

After second step, the remainder can be calculated as follows: 

20 Mj(x) x 2 ' - (M 0 (x) *x + m,) x 2t =(x*M 0 (x)*x 2t + mjx 2 ') =x*Q 0 *G + 
(x*R 0 (x) + mjx 2 ') = 

=(Qo*x)*G + (ro, 2t ., x 2t + mj x 2t ) + (r 0 ,2 l .2*x 2i - I +...+r 0 ,o*x) = 

25 =(Qo*x + r 0 . 2l -i+mi)*G + (mj +r 0 . 2t -i)*g(x) + (r 0 ,2 l - 2 *x 2 '' 1 +...+r 0 .o*x+0 *x° 

+), (16) 

therefore, 
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Rj(x) = (m, +r 0 .2i-i)*g(x) + (r w ^x 2 '' 1 + ^+r 0 ,o*x+0*x 0 ). (17) 

Using this approach, in general, the remainder r k+} (x) after step k+1 can be 
5 calculated from the remainder r k (x) at the previous step k and new incoming byte 
m k + } exactly in the same way 

Rm(x) = +r K2t -d*g(x) + fc/-2*^ /+ ...+a^+0Vj 



+r K2t -0*go ■ (18) 



This computation can be implemented using the general notion of "shift register" 
r consisting of 2t symbols 
15 r = r2t-!,r 2 t-2>^ro. (19) 

At each iteration, 2t symbols are shifted to the left with a zero symbol shifting in 
from the right and the left-most symbol r 2t -i shifting out of the register - this left- 
most symbol is added (XOR'ed) with the incoming symbol and forms "feedback 55 
20 symbol 

fk+i =rn k +i +r K 2 ( -h (20) 
Using these definitions, the remainder R k (x) is iterated according to the formula 

25 

Rk*i(x) = (fk+igit-i + r kt2t _2)x 2 '- ] + (f k+ jg 2t -2 + r K2t -3)x 2t ' 2 + ... +f k +igo (21) 
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A schematic diagram representing the iteration of the remainder is shown 
in FIG. 1. In FIG. 1, a message symbol 10 (m s ) is summed with a high order 
remainder 46 (r 3 ) to provide a Galois field element 14 (f). As noted above, 
Galois field addition is performed as an exclusive OR operation. The Galois 
5 field element 14 is multiplied by code generator coefficients 20, 22, 24, and 26 
(go, gi, g2 and g 3 ) to provide intermediate values 30, 32, 34 and 36. The 
intermediate values are summed with previous remainder symbols to provide 
current remainder symbols. Thus, the product fg 0 is summed with zero to provide 
remainder symbol 40 (r 0 ); the product fg! is summed with previous remainder 

10 symbol 40 to provide current remainder symbol 42 (rj); the product fg 2 is 
summed with previous remainder symbol 42 to provide current remainder 
symbol 44 (r 2 ); and the product fg 3 is summed with previous remainder symbol 
44 to provide current remainder symbol 46 (r 3 ). The process is repeated for each 
message symbol 10 in a codeword. After all message symbols in the codeword 

15 have been processed, the remainder symbols 40, 42, 44 and 46 represent check 
symbols for transmission with the message symbols. The codeword thus 
includes message symbols and check symbols. The example of FIG. 1 produces 
four check symbols for transmission with the message symbols and is capable of 
correcting two errors in the codeword. In a more common example, the code 

20 generator includes 16 code generator coefficients gi and produces 16 check 
symbols for correction of eight errors in the codeword. 

It can be seen that when the conventional look-up table approach to the 
calculation of the remainder polynomial is used, the main bottleneck in the flow 
of computation is the repeated computation of the result of multiplication in the 

25 GF. For every byte of the message, 2t multiplications are needed, so to encode 
one codeword with n information bytes and 2t check bytes, 2nt GF 
multiplications have to be performed. 
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In the important case of the Reed-Solomon code correcting 8 errors with 
the symbols from GF(256) t=8, n=239 or less, a total of 3824 GF operations are 
performed. With 10-12 cycles/operation, this amounts to up to 40 K 
cycles/codeword. 

5 In U.S. Patent No. 6,360,348 issued March 19, 2002 to Yang, a lookup 

table is constructed to facilitate calculations based on the observation that all the 
GF operations involved in one iteration of the remainder polynomial are 
performed on the same set of the second operands, namely, 2t coefficients of the 
code generator polynomial g(x). Correspondingly, 2t*n values of the results of 
10 the multiplications of all n elements of GF by 2t coefficients of the code 

generator polynomial g(x) are pre-calculated and stored in the lookup table so 
that the contents of each entry F(fJ) in the table is defined as: 

Ffti) =/0g h (22) 

15 

where / stands for the Galois field element in the polynomial form, g t is a 
coefficient of the code generator polynomial and sign "<g>" denotes Galois field 
multiplication operation. 

If this approach is used, even when the number of the errors corrected is 

20 relatively small, say 8 errors, and the Galois field used is GF(256), the size of the 
table is 4 Kbytes (16*256* lbyte, 16 is the number of check bytes to correct 8 
errors). Often several different options for the RS code should be executed - 
e.g., when using several latency paths in the ADSL application, the codeword in 
one latency path has 16 check bytes and in another latency path has 12 check 

25 bytes, in this case the memory size will be 7 Kb etc. 

According to a feature of the present invention, the GF operation / <8>g h 
where / stands for arbitrary GF element in the polynomial form, g t is a coefficient 
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of the code generator polynomial and sign "<8>" denotes Galois field 
multiplication operation, is performed in two steps using two small tables: 

1) An arbitrary GF element / is first represented as the sum of its "low" and 
"high" parts - e.g., in the GF(256) we can write 

i 

/= HO +0L (23) 



where Hand L are easily obtained from / with two AND operations using high 
and low masks 

10 

H =/&0xF0, L =f& OxOF (24) 

2) The GFMUL operation / 0 gl using properties of the Galois field, can be 
performed as 

15 

/ 0 gi =(H + L) 0 gi = H 0 gi + L 0 gi (25) 

The operations of equations (23) - (25) are illustrated in FIG. 2. As 
shown, Galois field element 14 is separated into a high halfword 50 (HO) and a 

20 low halfword 52 (HL) using high and low masks according to equation (24). The 
high and low halfwords are each multiplied by each of the code generator 
coefficients 20, 22, 24, and 26. The respective products are summed to provide 
intermediate values. The intermediate values are summed with the previous 
remainder symbols to provide current remainder symbols. The architecture of 

25 FIG. 2 is combined with lookup tables to achieve advantageous results. 

3) Using equation (25), two small lookup tables are constructed, each holding 
consecutive results of the GFMUL (Galois field multiplication) of the set 
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of generator polynomial coefficients g t by all 16 possible cases of the 
"low" and "high" halfword: 



TH(H > i) = (HO)®g i 



(26) 



TL(L i i) = (OL)0g i 



(27) 



4) The size of each table in the case of GF(256) and 16 coefficients of the 
code generator polynomial is 256 bytes, and the total size of the two tables 
is 512 bytes - 8 times smaller than in the prior art approach, with the total 
execution time only -50% longer because -50% of the operations are not 
directly connected with the execution of the GFMUL operation. 
Operations involved in the iteration of the remainder by one step (one 
information symbol/byte) can be executed in the following sequence: 

f=m k+I A r w ,_ 7 (28) 



for i=l:2t-l s=TH(H, i)* TL(L, i), n = s* r t . h r 0 = TH(H, 0) A TL(L, 0). (30) 

Each iteration requires two accesses to the lookup tables and subsequent XOR 
operations readily available on any microprocessor. 

Code generation using lookup tables TH and TL is illustrated 
schematically in FIG. 3. As indicated above, a first lookup table contains the 
values of high halfword HO multiplied by each of the code generator coefficients 
and a second lookup table contains the values of low halfword 0L multiplied by 
each of the code generator coefficients. The high halfword value is used with a 



f=H0 + 0L 



(29) 



745703.1 



- 14- 

coefficient index to access table TH to provide one or more high table values 54, 
and the low halfword value is used with the coefficient index to access table TL 
to provide one or more low table values 56. Respective high and low table 
values with the same coefficient index are summed in summing operation 60 to 
5 provide intermediate values, and the intermediate values are summed with the 
respective previous remainder symbols in summing operation 62 to provide 
current remainder symbols 40, 42, 44 and 46. As discussed below, the lookup 
tables can provide a single set of table values to be summed or two or more sets 
of table values to be summed, depending on the architecture of the system 

10 performing code generation. 

The content of the lookup tables in accordance with an embodiment of the 
invention is shown in FIGS. 4A and 4B. FIG. 4A illustrates a high halfword 
table 70 (TH), and FIG. 4B illustrates a low halfword table 72 (TL). As shown, 
table 70 contains the products of each possible value of the high halfword HO 

15 multiplied by each of the code generator coefficients. Thus, each entry in table 
70 contains a value of g,H, where / ranges from 0 to 2t-l and H ranges from 0 to 
the maximum value of high halfword H. Similarly, each entry in table 72 
contains a value of g,L, where / ranges from 0 to 2t-l and L ranges from 0 to the 
maximum value of low halfword L. For the common case where the message 

20 symbols are bytes, the high halfword has 16 possible values ranging from 00 to 
F0 (using hexadecimal notation). The number of code generator coefficients 
depends on the number of desired check symbols and may be 16 for the case of 
eight check symbols. Similarly, table 72 contains the products of each possible 
value of the low halfword 0L multiplied by each of the code generator 

25 coefficients. Where the message symbols are bytes, the low halfword values 
range from 0 to F. .Thus, for the case where the message symbols are bytes and 
the code generator has 16 coefficients, each of lookup tables 70 and 72 has 256 
entries. In the example of FIGS. 4A and 4B, tables 70 and 72 are organized such 
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that each row contains the products corresponding to one halfword value. As 
discussed below, this organization facilitates parallel execution in a typical 
processor. 

The invention has been described in connection with embodiments where 
5 an eight-bit Galois field element is divided into high and low halfwords and two 
lookup tables are utilized. In other embodiments, each Galois field element may 
have more than eight bits. Furthermore, each Galois field element may be 
separated into more than two components and more than two lookup tables may 
be utilized. Consider a 16-bit Galois field element. The approach described in 

10 Patent No. 6,360,348 requires a 2 M byte lookup table. In an embodiment where 
each 16-bit Galois field element is separated into two components, two lookup 
tables, each of 8 K bytes (16 coefficients x 256 half GF element values x 2 bytes 
per entry), are required. In an embodiment where each 16-bit Galois field 
element is separated into four components, four lookup tables, each of 512 bytes 

15 (16 coefficients x 16 quarter GF element values x 2 bytes per entry), are required. 

For any DSP (digital signal processor) with a data access word size more 
than 8 bits wide, which is the case for all modem DSPs, an additional advantage 
of this embodiment can be achieved simply because of the fact that table accesses 
can be conducted with a monotonically increasing second index, so the table can 

20 be used as one-dimensional array with sequential access, and shift register cells 
are used in a strictly sequential order. This fact greatly speeds up the execution, 
first because multiple entries are read in one cycle, and second because the XOR 
operation can also be performed on the multiple Galois field symbols 
simultaneously - e.g., 4 symbols at once when implemented on DSPs with a 32- 

25 bit data word path, 8 symbols for DSPs with a 64-bit data word path, etc. For 
example, one particular implementation on the Blackfin DSP, which has a 32-bit 
data path with certain restrictions, works with a speed close to 1.5 cycle/check 
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byte/data byte, which is 5-6 times faster than the conventional method and even 2 
times faster than a DSP having a built-in Galois field multiplier. 

A block diagram of a digital signal processor for encoding data according 
to an embodiment of the invention is shown in Fig. 5. A process for encoding 

5 data in a digital signal processor having a word width of w Galois field elements, 
according to an embodiment of the invention, is shown in Fig. 6. In the 
embodiment of Fig. 5, a core processor 100 of the digital signal processor 
receives message symbols m and supplies check symbols r to be transmitted with 
the message symbols of a codeword. The core processor 100 accesses a high 

10 halfword lookup table 110 and receives table values giH, g i+ iH, gi+ 2 H and gi +3 H. 
The core processor 100 also accesses a low halfword lookup table 112 and 
receives table values giL, gi+iL, g i+2 L and g i+3 L. In the embodiment of Fig. 5, 
message symbols m are bytes and core processor 100 has a word width of 32 bits. 
Accordingly, core processor 100 may process four table values per cycle from 

15 each of lookup table 1 10 and lookup table 1 12 as shown in Fig. 5 and described 
below. Core processors having different word widths may process more or fewer 
table values per cycle. 

Referring again to Fig. 6, a next message symbol m is received and 
processed in step 150. In step 152, a Galois field element f is calculated from the 

20 message symbol and a high order remainder symbol, as shown in Equation (28). 
In step 154, the Galois field element is separated into first and second 
components, HO and 0L, as shown in Equations (23) and (24). In this 
embodiment, the Galois field element is separated into high and low halfwords. 
However, in other embodiments, the Galois field element may be separated into 

25 more than two components. 

In step 156, the high halfword HO and an index i are used to access w table 
values from lookup table 110 (Fig. 5). The index i is an index of generator 
coefficient g*. In the example where the codeword has 16 check symbols and the 
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code generator polynomial has 16 coefficients, index i ranges from 0 to 15. Also 
in step 156, low halfword OL and index i are used to obtain w table values from 
lookup table 112 (Fig. 5). As indicated above, the number of table values 
accessed simultaneously depends on the word width of core processor 100 

5 expressed in units of Galois field elements. For a typical case where core 

processor 100 has a 32-bit word width and the message symbols are bytes, w=4 
table values are accessed simultaneously in each of lookup tables 110 and 112. 

In steps 160 and 162, a remainder symbol ri is calculated for each of the w 
sets of table values accessed in step 156. More specifically, an intermediate 

10 value Si is calculated in step 160 for each of the w sets of table values as shown in 
Equation (30). A set of table values includes a table value from lookup table 1 10 
and a table value from lookup table 112, the two table values having the same 
generator coefficient index. Then, a current remainder symbol r* is calculated by 
an exclusive OR operation on the intermediate value Si and a previous remainder 

15 symbol r^ for each of the w intermediate values ^ as also shown in Equation 
(30). Thus, w remainder symbols are calculated instep 162. 

In step 170, a determination is made as to whether 2t remainder symbols 
have been calculated, where 2t represents the number of coefficients in the code 
generator polynomial. If 2t remainder symbols have not been calculated, the 

20 index i is incremented by w in step 172. The process then returns to step 156 to 
access w new table values in table 110 and w new table values in table 1 12. The 
loop including steps 156, 160, 162, 170, and 172 is repeated until 2t reminder 
symbols have been calculated. For the example where w=4 and 2t=16, four 
iterations of the loop are required. 

25 When 2t remainder symbols have been calculated, a determination is made 

in step 1 80 as to whether the end of the codeword has been reached. A typical 
codeword may include 188 message symbols. When the end of the codeword has 
not been reached, the process returns to step 150 to process the next message 
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symbol m. When the end of the codeword has been reached, 2t check symbols 
are output in step 182. Thus, for example, 16 check symbols may be output for 
transmission with the 188 message symbols of the codeword. 

In some applications, such as an ADSL modem, the parameters of the RS 
5 code can be set "on the fly" and can be different for different bearer channels 

within one ADSL channel. In this case, two methods of encoding can be realized 
based on the present invention. 

According to the first method, only the table for the highest possible 
number of check bytes is generated, and if a code with less error correction 
10 capability is used, the encoder works in two stages as follows. 

First, it is observed that a code generator polynomial G 2t of a higher degree 
contains as a factor a code generator polynomial G 2s of a lower degree, 



G 2s (x) = (x-a 2 *- 1 )... (x-a°) (31) 

15 

G^^fxV'^ = h 2d (x)G 2s (x)Q2) 

where 

20 h 2d (x)=(x-a 2iJ ).. .(x-a 2s ) (33) 

and d=t-s is the difference between the error-correcting capabilities of the code. 

Correspondingly, a message polynomial of degree n is partitioned into two 
parts - higher n-2d terms and lower 2d terms 

25 

M n (x) =M n . 2d (x)x 2d +M 2d (x) (34) 
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and a shifted message polynomial, into which the code generator polynomial 
should be divided, is represented as 

M„(x)x 2s = M n . 7d (x) x 2d x 2s +M 2d (x)x 2s = M n . 2d (x) x 2s+2d +M 2d (x)x 2s = M n . 2d (x) 
5 x 2t +M 2d (x)x 2s . (35) 

Now the remainder for the first n-2d bytes of the message in relation to the code 
generator polynomial of the higher degree 2t is obtained as 

10 M n . 2d (x)x 2t =Q(x)G 2t (x) + R(x) (36) 

where the first term can be written as 

Q(x)G 2t (x) = Q(x)h 2d (x)G 2s (x)=Q,(x)G 2s (x) (37) 

15 with 

Qj(x) =Q(x)h 2d (x). (38) 

Following the same logic, the "long" remainder R(x) consisting of 2t is broken 
into two parts - higher 2d terms and lower 2s terms 

20 

R(x) = r 2d (x)x 2s + R 2s .j(x) (39) 

and finally the encoding can be performed according to the formula 

25 M n (x)x 2s = M n . 2d (x) x 2 ' +M 2d (x)x 2s , = Q,(x)G 2s (x) + M 2d (x)x 2s + r 2d (x)x 2s + R 2s .,(x) 

(40) 

This is equivalent to the encoding relatively short message (only 2d bytes) 
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(M 2d (x) + r 2d (x))x JS = Q 2 (x)G 2s (x)+R 2 (x) (41) 

and adding the result of this operation with the "shortened" result of the "long" 
5 encoding 

M n (x)x 2s = Q } (x)G 2s (x) + Q 2 (x)G 2s (x)+R 2 (x) + J^fc) =(G,(5c; + Q- 

2(Sc;;g 2 /x;+^(3c; + i^*;; (42) 

10 As a result, the total chain of encoding can be described as follows: the last 

2d bytes of the message M 2d (x) are added (XOR'ed) with the first (left-most) 2d 
bytes of the obtained remainder r 2d (x) and these 2d bytes are treated as a short 
message and are encoded using the conventional method; the result is added to 
the remaining bytes of the "long" remainder R(x) - this is the final result of the 

15 encoding. 

This method has overhead D(t,d) attributed to the additional "reencoding" 
of 2d bytes, which is proportional to the number of bytes encoded and number of 
bytes in the result 

20 D(t,d)=K*2d*2(t-d), (43) 

where K is the number of cycles - usually 10-12 - required for conventional 
encoding. This overhead has a maximum value for d=t/2 K*t 2 , and can be at the 
level of 768 cycles with K=12, t=8, which constitutes —20% of the main part of 
25 the encoding. 

Another option is to generate "right" tables on the fly for any required code 
parameters which can require additional memory. To take full advantage of a 
DSP with a wide data path, however, in this case, according to an aspect of the 
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invention, when the number of check bytes is not an integer multiple of 4, the 
table is generated with a size 32*4k, where 4k is the closest integer multiple of 4 
from above to the specified number of check bytes - e.g., 16 for 14, 12 for 10, 
etc. The remaining 64 bytes are filled by zeros, so when these values are read, 
5 they don't affect the result of the calculation. 

The same tables used in the encoder may be used in the decoder for 
calculating the syndromes in the following manner. The received codeword in a 
systematic form is first "re-encoded" - the encoder mechanism is run on the 
information bytes of the codeword to obtain a "copy" of the sequence of the 
10 check bytes actually received. Then this reconstructed "copy" is bitwise 

compared using a logical AND operator with the actually received check bytes 
forming the polynomial closely related with the syndromes - namely, the values 
of this polynomial on the consecutive powers of the primitive element of the GF 
are exactly the components of the syndrome because of the relation 

15 

M(x)= Q(x)G(x)+R(x), S k =M(a k ) = Q(a k )G(a k )+R(a k ) = R(a k ), k=0J, ...2t-l 

(44) 

because G(a k ) =0 by definition of the code generator polynomial 

20 

G(x)=(x-a 2tJ )... (x-a°) (45) 

So the accumulation of the syndrome can be performed using the 
conventional approach on a much shorter sequence of 2t bytes than the original 
25 codeword of length n bytes. This involves (2t-l) 2 non-trivial GF multiplications 
which is of the same order of operations needed to implement Berlekamp-Massey 
or Euclid algorithm to find the coefficients of the error locator polynomial. 
Because GF multiplications are performed on a fixed set of one of the operands, 
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the speed of this calculations is actually 30-50% higher than the general GF 
multiplication so in the case of a (255,239,16) RS code, at most 225*6=1350 
cycles are required, which is 30 % overhead. 

Having thus described the inventive concepts and a number of exemplary 
5 embodiments, it will be apparent to those skilled in the art that the invention may 
be implemented in various ways, and that modifications and improvements will 
readily occur to such persons. Thus, the examples given are not intended to be 
limiting, and are provided by way of example only. The invention is limited only 
as required by the following claims and equivalents thereto. 
10 What is claimed is: 
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